Modelling time to event with observations 
made at arbitrary times 

X> : MATTHEW SPERRIN* 

<D ■ 

Department of Mathematics and Statistics, Lancaster University, UK 



m.sperrin@lancaster.ac.uk 

IAIN BUCHAN 



North-west Institute for Bio-Health Informatics, 
School of Community Based Medicine, 

•i— > \ 

\ University of Manchester, UK 

January 12, 2013 

(N 
(N 

m 

g ■ Abstract 

We introduce new methods of analysing time to event data via 
extended versions of the proportional hazards and accelerated failure 
time (AFT) models. In many time to event studies, the time of first 
observation is arbitrary, in the sense that no risk modifying event 
occurs. This is particularly common in epidemiological studies. We 
show formally that, in these situations, it is not sensible to take the 
first observation as the time origin, either in AFT or proportional 
hazards type models. Instead, we advocate using age of the subject 
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as the time scale. We account for the fact that baseline observations 
may be made at different ages in different patients via a two stage 
procedure. First, we marginally regress any potentially age- varying 
covariates against age, retaining the residuals. These residuals are 
then used as covariates in the fitting of either an AFT model or a 
proportional hazards model. We call the procedures residual accel- 
erated failure time (RAFT) regression and residual proportional haz- 
ards (RPH) regression respectively. We compare standard AFT with 
RAFT, and demonstrate superior predictive ability of RAFT in real 
examples. In epidemiology, this has real implications in terms of risk 
communication to both patients and policy makers. 

Keywords: accelerated failure time, age, proportional hazards, survival 
analysis, time origin, time varying covariates. 



1 Introduction 



Two common methods for anal ysing time to e v ent d ata are proportional haz 



ards models (see, for example, 



Cox Sz Oakesl. 



time (AFT) models (see, for example, 



We 



19841 ) and accelerated failure 



19921 ). When constructing these 



models, a suitable time scale must be chosen. For example, time may be mea- 
sured as age of the subject, or on the other hand it may be measured as the 
subsequent time to event after an intervention or observation. In the latter 
case, age at study entry is often included as a covariate. If no risk-modifying 
event occurs at the time of first observation, this time is somewhat arbi- 
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trary ( IFieberg fc DelOiudicel . Il997t iThiebaut fc Benichoil 12004 ) . Despite 
this, time-in-study is often used as the time scale in epidemiological studies. 

The choice of time origin in survival analysis can be a difficult issue, 



but t his choice is an essential part of any survival analysis ( iCox fc Oakes 



19841 ) . It can be seen as a question of aligning individuals under consider- 



ation in a sensib 



e wa y by choosing 'time zero'. This is first considered in 



Farewell &: Cox! (119791 ). who compare survival models for breast cancer in- 
cidence in women, taking the time origin as age, and alternatively as the 
time since birth of the first child. They find that tak i ng tim e as age offers 



the most parsimonious solution. 



Liestol fc Andersen! (12002! ) also argue for 



the use of age or calendar time as the time origin when a time of incidence 
or initial observation is difficult to define, such as the situations we consider 



here. IFieberg &; DelGiudicd (jl997f ) strongly advocate the use of age as the 
time origin in epidemiological studies, indeed they declare that using time 
since first observation as the time scale is 'incorrect'. 

Using different time scales will lead to different estimates of parame- 



ters in the time to event model. 



Thiebaut fc Benichoul (120041 ) carry out 



an extensive simulation study on this issue, and also strongly advocate us- 
ing age as the time or i gin ra ther than time on study. On the other hand, 



Fieberg Sz DelGiudicd (12009! ) advocate start of study as the most sensible 
time origin, following comparison of survival estimates using three different 
time origins: age, start of study and calendar time. 



We do not discuss in this paper the related issue of having multiple scales 
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in wh ich time can be measured, for example age and mileage of a car ( jOakes 



1995|). 



Suppose we have n subjects. Denote by Sj the age of individual i, which 
obviously changes with time. Suppose the individual is first observed at age 
Oj, at which point a series of covariates are extracted. Denote covariates that 
are not expected to vary with age by x iy and those that are expected to vary 
with age by Zi iSi , where the dependence on age is made explicit. The values 
of the time varying covariates that we actually observe are given by z^ . Let 
ti denote the time that has elapsed since individual i was first observed. 

This paper makes two contributions. First, we show formally why mea- 
suring time since observation, ti, should be rejected in situations where no 
intervention is made at the time of observation. Second, we propose instead 
that the age of the individual, Sj, should be used as the time scale. As an 
extension to proportional hazards models, we propose a two-stage procedure 
called residual proportional hazards (RPH) regression, in order to correct for 
the fact that individuals' age-varying risk factors are observed at different 
ages <2j. Similarly, we propose the corresponding extension to the accelerated 
failure time model, which we call residual accelerated failure time (RAFT) 
regression. 

The paper is organised as follows. In Section[2]we justify formally why we 
reject the idea of measuring time as time since first observation. In Section 
[3] we introduce RPH and RAFT regression. In Section H] we demonstrate the 
performance of RAFT regression compared to existing methods of analysing 
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2 Measuring Time Since First Observation 

In this Section we justify our stance that time should be measured as the 
age of the individual, Sj, rather than as time since first observation, U . This 



is a stance shared by 



Fieberg fc DelGiudice 



many others (see, for example 



1997 



Liestol fc Andersen 



2002 



Breslow et al 



1983 



Thiebaut fc Benichou 



20041 ). We show, however, that using time-in-study as the time scale in- 
evitably leads to elementary contradictions in situations where both the entry 
to study does not coincide with a risk modifying event, and age is expected 
to influence the time to the event of interest. 

With t denoting time to event, as measured from first observation, let m(t) 
denote the median time to event, or some other similar midpoint estimate of 
time to event (MTE). Let s denote age of an individual. Consider comparing 
two individuals, i and j, who are identical with respect to all their covariates 
except age. Without loss of generality assume that one individual is observed 
at age Sj = a, and the other is observed at age Sj — 0. At least in situations 
where t is the time after an observation that does not include an intervention, 
it seems reasonable to make the following two demands. 



The older person should have a shorter MTE: 



m(t\a) < m(t\0). 



(1) 
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When the event is naturally more common with advancing age, with 
all other things being equal, this is a reasonable assumption. Such is 
the case with common endpoints, such as death or disease incidence in 
epidemiological studies. 

• The fact that the older person has survived to age a while avoiding 
the event should make his age at the MTE greater than the age at the 
MTE of the younger person. This can be expressed as the inequality 



This requirement might be more subtle, but is logical in situations such 
as the following: suppose the event of interest is death from a chronic 
disease, and time is measured from a clinic appointment at recruitment 
to the study. Suppose we have two subjects, one older than the other, 
whose risk factors are otherwise identical. Then since the younger 
person is exposed to hazard between their current age, and the age of 
the older person, their expected age at death should be less. 

Moreover, if we use time since first observation as the time scale, we should 
usually require that age should be included as a covariate in the model, since 
all other things being equal we would expect a failure event to occur to an 
older person sooner than a younger person. We will now show that in a model 
where time is measured as time since first observation, it is not possible to 
both include age as a covariate, and satisfy the two requirements specified 




(2) 
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by inequalities (CEJ) and fl2]). We demonstrate this result for both AFT models 
and proportional hazards models. 



2.1 Proportional Hazards Model 

Consider a proportional hazards model where age at observation, a, is the 
only covariate under consideration. This would be given by 



for an individual of age a, for some /3. Common choices for the age transfor- 
mation function /(■) are f(a) = a and f(a) = log(l + a) (where we add one 
to avoid the issue of taking logs of zero). Other covariates can be considered 
absorbed by the baseline hazard ho. We are agnostic as to whether the base- 
line hazard is parametric or not. The survival function S(t; a) is related to 
the hazard function via 



h(t; a)=exp{Pf(a)}h (t) 



It 



log S{t; a) 



h(t;a), 



and hence the proportional hazards model implies that 



S(t; a) = So(t; a) 



exp{/3/(a)} 
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where So denotes the baseline survival function. A general form for the 
median of the proportional hazards model is therefore given by 

S-\0.5) = Sq 1 [o.5 ex P{-W°»] . (3) 

Suppose that /(a) = a. See the remark at the end of this Section for /(a) = 
log(l + a). 

Claim 1. In order for the proportional hazards model to satisfy inequality 
(f7]j ; we must have f3 > 0. 

Proof. Substituting into inequality ([!]) gives 

We consider the consequences of this inequality holding for the value of f3. 
First we can apply the baseline survival function Sq to both sides, as Sq is a 
decreasing function, yielding 

0.5 < 0.5 exp( "^ a) . 

Taking logs of both sides and dividing through by log(0.5) (which is negative) 
yields 

exp(— (3a) < 1, 

from which we conclude that we require (5 > to satisfy inequality ([!]). □ 
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Claim 2. In order for the proportional hazards model to satisfy inequality 
IIP, we must have f3 < 0. 



Proof. Substituting Q into inequality ([2]) gives 

Sq\0.5) - a < Sq 1 ^^-^}. 

Again, we consider the consequences of this inequality for (3. Taking So of 
both sides gives 

S {So 1 (0.5) -a} > 0.5 cxp( ~^ a) . 
Taking Taylor expansions of the left hand side about S' ~ 1 (0.5) gives 

0.5 - na + 0(a 2 ) > O.S ^"^, 

where k = S' {S' " 1 (0.5)}. After standard manipulations, we then arrive at 



13 < — k>£ 
a 



log {0.5- Ka + 0(a 2 )} 
log(0.5) 



(4) 



We next note that inequality fl2]) should hold for any reasonable a, and cer- 
tainly for small values of a, so take limits as a l of the right hand side of 
(jlj). This limit can be calculated using l'Hopital's rule, 



aio g(a) g'(0) 
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taking 

"log {0.5- na + 0(a 2 )} 



/(a) = - log 
Applying this, we get 



log(0.5) 



g{a) = a. 



Finally, recall that k = Sq^Sq 1 (0.5)}. It is a derivative of a survival func- 
tion, and therefore must be non-positive. Since log(0.5) is negative, overall 
the right hand side of ([5]) is non-negative. As a derivative, k can be made 
arbitrarily small by rescaling the time axis. Such a time rescaling affects 
the baseline hazard, ho(t) only, and not the magnitude of f3. Therefore we 
conclude (3 < to satisfy inequality (T2]). □ 

Combining Claims [1] and [2] we must conclude that, in order to satisfy 
inequalities ([1]) and fl2]), we must have /3 = 0. In other words, having an age 
covariate is inconsistent with these requirements. 

Remark. Setting f(a) = log(l + a) gives the same result. This is clear in 
Claim [H For Claim [21 the argument would proceed identically up to (TjJ. 
The only difference is that we have g(a) = log(l + a) rather than g(a) = a 
for l'Hopital's rule, but these both yield the same g'(0). 



2.2 Accelerated Failure Time Model 

An accelerated failure time (AFT) model is typically a log-linear model for 
the survival time. (Or, if not the survival time directly, the scale parameter, 
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which is proportional to the survival time). Consider an AFT model where 
time is measured as time since first observation, and age, a, is the only 
covariate under consideration. This would be given by 



Here, a is used to denote any other terms in the model, so may depend on 
background risk factors, etc. As before, the age transformation function /(•) 
is commonly the identity, f(a) = a; or a log transform, f(a) = log(l + a). 
As before, we consider comparing two individuals, i and j, one of age Sj = a, 
the other of age Sj = 0, with all other covariates identical. 
The following lemma will be useful. 

Lemma 3. In the AFT model of Equation making an arbitrary change 
of factor k to the time scale will alter the value of a but not j3. Therefore 
make explicit this dependency by writing a(k). Then moreover, for any e > 
there exists a scaling factor k e such that exp { — a(fc e )} < e. 

Proof. Rescaling time by factor k leads to 



m(t) = exp {a + /?/(a)}. 



(6) 



m(kt) = km(t) = A; exp {a + /3/(a)}. 



Setting a' = log (A;) + a gives 



m(kt) = exp {a' + (3f(a 



>} 
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proving the first part of the lemma. For the second part, it suffices to take 



As in the previous Section, we now consider the case where f(a) = a, and 
relegate comments about the case f(a) = log(l + a) to a remark at the end 
of this Section. 

Claim 4. In order for the accelerated failure time to satisfy inequality (QP, 
we must have (3 < 0. 

Proof. Substituting §6§ into inequality ([T|) gives 



Claim 5. In order for the accelerated failure time to satisfy inequality 
we must have (3 > 0. 

Proof. Substituting §6§ into inequality ([2]) gives 



k e < eexp(a). 



□ 



exp(a + (3a) < exp(ct) 



which immediately gives f3 < 0. 



□ 



exp(a) — a < exp(a + (3a). 



Re- arranging, 



0> 



log(l — ae a ) 



a 
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Now we require this for all a > 0; taking limits as a 4- yields 

(3 > -e~ a . 

However, by Lemma |3l for every (3 < there exists a time rescaling that 
gives an a violating this inequality, hence we must conclude (3 > 0. □ 

Combining Claims H] and [5] we must conclude that, again, in order to 
satisfy inequalities (JTJ and (J2J), we must have (3 = 0. 

Remark. The same solution is reached for f(a) = log(l + a). This is trivial 
for Claim HI and for Claim |5] it follows because log(l + a) ~ a for small a. 

The problems of having two measurements of time i n a survival model, 
both advancing at the same rate, are alluded to by 



Liestol fc Andersen 



(120021 ) . who call this an 'exchangeable' effect, i.e. there is an information 



redundancy between the two time measurements, Sj and U. 



2.3 Example 



Wilson et al 



(120081 ). They 



The following example is based on the model of 
construct a log-linear model for the scale parameter, X(x) say, of a Weibull 
density, which is proportional to the expected survival time, where the out- 
come is heart attack or stroke. Survival time is measured as time since an 
observation is made, and age at observation is included as a covariate. Pre- 
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cisely, their model is 

X(x) = exp{14.9756 — 0.0159 x body mass index 

— 0.0571 x age in years — 0.4959 x smoker 

— 0.0070 x systolic blood pressure 

— 0.1432 x total to HDL cholesterol ratio 

— 0.3421 x diabetic + 0.5139 x male}. 

Consider a reasonably healthy person, say body mass index = 20, a non- 
smoker, systolic blood pressure = 120, total to HDL cholesterol ratio = 3.5, 
non diabetic and male. An example of a counterintuitive result is that a 50 
year old with the above risk factors and characteristics has 



P [event over next 25 years] = P [event between age 50 and 75] = 0.147. 

Yet, a 55 year old with the same characteristics has 

P [event over next 20 years] = P [event between age 55 and 75] = 0.159. 

This is clearly rather odd, as one could imagine this is the same person five 
years later, who remains in good health and has not had the event, yet their 
probability of an event by age 75 has increased. This demonstrates when it 
is not sensible to allow inequality (J2J) to be violated. 
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Many risk factors are expected to rise with advancing age. For example, 
blood pressure and cholesterol generally increase with age. This could be 
problematic for a time-to-event regression where age is time, since the levels 
of some risk factors may depend upon the age of the individual when they 
were observed, aj. Not adjusting for this would give people observed at an 
older age a worse prognosis, simply because their risk factors were higher 
when measured. A younger person may get a better prognosis as his risk 
factors are lower when measured, despite the expected trajectory of those 
risk factors being higher than the older person. It therefore makes sense to 
correct for the age at which the risk factors have been measured. 

We propose a two-stage procedure. In the first stage, risk factors that 
vary with time are regressed marginally with age, and the residuals from this 
regression retained. In the second stage, time to event regression is carried 
out (via either AFT or proportional hazards), but using the residuals of the 
time varying risk factors as covariates rather than their individual values. 

3.1 Stage 1 

Using the same notation as before, for each individual we have a pair (a,, Zi tCH ), 
consisting of the age at which the first observation was made, and the val- 
ues of the age-varying risk factors at that age. For simplicity of exposition, 
suppose there is just one age- varying risk factor, Zi )(H . Suppose that the 
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age-varying effect can be captured by a function / that is common to all 
individuals i = 1, . . . , n: 



and of course, this is true in particular for Sj = aj. We can therefore con- 
sider estimating / by regressing {z itCli , i = 1, . . . , n) on {di, i = 1, . . . , n}. 
For simplicity, in this paper we consider the case where / is (or at least well 
approximated by) a linear function of age. After estimating / we can recover 
the associated residuals {e^i = 1, . . . ,n}. Assuming that / correctly cap- 
tures the dependence between the risk factor level and age, we can assume 
that these residuals are independent of age. 

If we have multiple risk factors, p say, we simply repeat the procedure 
outlined above, marginally for each risk factor, to obtain residuals for each 
age- varying risk factor, = (ej, . . . , ef), for each individual i — 1, . . . , n. 

One could view this stage of the procedure as defining individuals by their 
percentiles in the age-specific risk factor distributions, which are likely to be 
more stable over age and time than the risk factors themselves. 



In the second stage of the procedure, we carry out the desired time to event 
regression. We suggest here proportional hazards regression or AFT regres- 
sion, but any time to event regression methods would be valid. Instead of 
using the age-varying risk factors as covariates, we use their residuals from 




(7) 



3.2 Stage 2 
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Stage 1. Age is taken as the time scale. 

For AFT, the model is log-linear for a scale parameter Aj, given by 



where Xi is the vector of fixed risk factors, are the residuals of the age- 
varying risk factors derived in stage 1, and the /3s are the unknown parameters 
to be estimated. Typically the scale parameter Aj is equal to, or at least 
in proportion to, the failure age Sj. There is also a shape parameter that 
is usually assumed to be fixed. Various (strictly non-negative) parametric 
baseline distributions are generally considered for AFT regression, such as 
gamma, Weibull, log-logistic, log-normal, Gompertz and extreme value. The 
distribution of the error term e, depends on the choice of baseline distribution. 
In this case we call the entire procedure is called residual accelerated failure 
time (RAFT) regression. 

A proportional hazards model is log-linear in the hazard function of the 
i th individual h^. The baseline hazard, /i , may or may not be specified 
parametrically. This model can be written as 




(8) 




(9) 



where the terms £1X6 cLS for the AFT model. In this case we call the entire 
procedure is called residual proportional hazards (RPH) regression. 
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4 Examples 

In this Section we compare the predictive ability of three different AFT-based 
regression methods: 

1. Age as a covariate, time is measured as time from observation (denoted 
AFT- AC). 

2. Age as time, age- varying covariates not adjusted for age (denoted AFT- 
NA). 

3. Age as time, age- varying covariates adjusted for age by taking residuals 
from a regression on age (denoted RAFT). 

We focus on the comparison of AFT models: since AFT is a log-linear 
model for a parameter of the presumed distribution, they retain the orig- 
inal distributional form. Proportional hazards models, on the other hand, 
only retain their original distribution if that distribution possesses the pro- 
portional hazards property. The three distributions we consider here are 
Weibull, log-normal and log-logistic. When a Weibull distribution is as- 
sumed, proportional hazards models and AFT models are equivalent. 

We take a s our data the US Health and Nutrition Ex amination Survey 



(NHANES-I) ([National Center for Health Statistics 



19731 ). and the outcome 



we are interested in is the time to the first non-fatal myocardial infarction 
(NFMI). Accelerated failure time regression models are fitted to the data, us- 
ing the three baseline distributions, in each of the three paradigms discussed 



Modelling time to event with observations made at arbitrary times 



19 



above. 



The predictive ability of the models is compared by splitting the cohort 
randomly in half, estimating the models on the first half (training data), 
and attempting to predict failure ti mes in the s econd half (validation data). 



Specifically, we use the Brier score ((Brier 



19501 ) as the measure of predictive 



ability of the mod els. Other measures of predictive ability of a survival model 
are considered in iKorn fc Simon! ( 119901 ). We discretize time into years. Let 



n be the number of individuals. Let s$ denote the time in years at which 
individual i is first observed. For the AFT-AC model, Sj = 1 for all i, whilst 
for the other two models Sj is the age at first observation, denoted aj. Let 
ti = min(cj, M), where q denotes the time in years at which individual i is 
censored (with q = oo if censoring does not occur), and M is the maximum 
time in years at which observations can be made. For the AFT-AC model, 
M is the maximum follow-up time over the i individuals, whilst for the other 
two models M is the maximum age reached by an individual whilst still alive 
and not lost to follow-up. The outcome of interest is whether NFMI has 
occurred. Therefore, let 

1 Individual % has had NFMI by time j 



otherwise. 
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Let gij be the predicted probability from the model that NFMI has occurred. 
This is given by 




F(j) for AFT- AC 

F Ci+°0-J\°0 for AFT-NA and RAFT, 



where F is the cumulative distribution function of the assumed distribution. 
Finally, let N = ^" =1 (tj— s*). Then we calculate the Brier score for predictive 
ability as 

j n tj 
i=l j= Si 

A model with good predictive ability should produce a small Brier score. 

We repeat this procedure 100 times, with random splits each time into 
the training and validation set, and report as the final Brier score the average 
of the 100 runs. The results are given in Table [TJ 

[Table 1 about here] 

Of the three methods, AFT-NA gives the largest Brier score in all sit- 
uations. This is not surprising, since this method makes no correction for 
the age of the participant at observation. AFT-AC gives a moderate Brier 
score; this method corrects for age, but in a manner that has been shown, in 
Section El to be incoherent. RAFT always gives the lowest Brier score, so is 
the optimal model of the three for this data; this model corrects for age in 
a coherent manner. It is also noteworthy that there is stability in the Brier 
scores for RAFT across the three distributions that have been considered 
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5 Discussion 

We have shown that measuring time from a baseline that consists of an 
observation of an individual, but no intervention, is not sensible. In such 
situations, age should always be chosen as the time scale. On the other 
hand, it can make sense to measure the time from a baseline that consists 
of an intervention, since in this case it is entirely reasonable that inequality 
(J2J) is violated; for example, in randomised controlled trials. The modeller 
should, however, be aware that inequality (J2]) is violated in this case, and 
think carefully about whether this is reasonable. 

We propose new methods called residual proportional hazards (RPH) and 
residual accelerated failure time (RAFT) regression. These are two stage pro- 
cedures: the first stage regresses age-varying risk factors on age to eliminate 
the effect of patients entering the study at different ages; the second stage 
carries out time to event regression (either AFT regression or proportional 
hazards regression), replacing the age- varying risk factors by their residuals 
derived from the first stage. 

The additional predictive capability obtained from this two-stage pro- 
cedure have implications for communicating risk to patients. Existing risk 
scores, commonly based on AFT-AC (AFT with age as a covariate, and the 
time origin as time of first observation) are shown in Section H] to have weaker 
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predictive ability, and in Section [2]to lead to logical contradictions in the risk 
score assigned. Therefore, we believe that risk scores based on the methods 
outlined in this paper will lead to improved mortality estimates, which will 
enable better decisions to be made, both at the policy level and the individual 
patient level. 

Epidemiologists often refer to three principal sets of factors: time; place; 
and person. We note that RAFT handles two dimensions of time in a way 
that is analogous to the multi-stage regression method s that the field o 



Geostatistics employs to handle two dimensions of space ( IDiggle fc Robeiro 



20071 ) 



Extensions to RAFT regression can be considered. First, in this paper we 
have considered only linear forms for the function / in Equation Clearly 
this will often be unrealistic. Non-parametric estimates of this function could 
be considered, for example. In addition, one may wish to consider interac- 
tions between the age-risk factor relationship for multiple age-varying risk 
factors. This would involve replacing a series of marginal models of the form 
given in Equation Q by a joint model. 

Second, the estimation of / is prone to a 'survivor bias'. Subjects with 
higher values of risk factors tend to die younger, hence subjects recruited at 
an older age have probably had lower risk factor trajectories, on average, than 
those recruited at a younger age. One could correct for this by constructing a 
procedure that iterates between Stage 1 and Stage 2 of the method outlined in 
this paper. An estimate of the risk factor effect on survival can be estimated 
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from Stage 2, which can then be passed back to Stage 1 to account for this 
bias in estimating /; then the risk factor effects re-estimated in Stage 2 based 
on the new residuals, and so on. Note that this would also require the risk 
factor effect on deaths from other causes (causes not of interest) and other 
censoring mechanisms to be estimated in the second stage. 

Finally, we have not considered the propogation of uncertainty from the 
first stage of the RAFT regression procedure to the second, which may lead 
to standard errors being underestimated. 
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Table 1: Mean Brier scores 

Baseline AFT-AC AFT-NA RAFT 

Weibull 0.0217 0.0605 0.0149 

Log-Normal 0.0221 0.0384 0.0148 

Log-Logistic 0.0160 0.0180 0.0148 



